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WERE DIVIDED INTO 3 GROUPS WHICH RECEIVED 1 OF 3 TRAINING 
PROCEDURES— (1) FADING-NO DELAY t (2) CONSTANT-NO DELAY t AND 
(3) CONSTANT-DELAY. THE STIMULI WERE A HORIZONTAL AND A 
VERTICAL LINE PROJECTED ONTO A PLEXIGLASS SCREEN. THE 
RESPONSE CONSISTED OF PUSHING ON THE SCREEN t AND THE S4> 
STIMULUS WAS REINFORCED WITH CANDY. IN THE INITIAL SESSION 
THE S-i* STIMULUS WAS PROJECTED 5 TIMES TO ESTABLISH THE 
RESPONSEt AND THEN THE S- STIMULUS WAS INTRODUCED IN A RANDOM 
FASHION. IN THE FADING PROCEDUREt ITS INTENSITY AND DURATION 
WERE GRADUALLY INCREASED UNTIL IT MATCHED THAT OF THE S-f 
STIMULUS. IN THE CONSTANT PROCEDURE IT WAS KEPT AT THE SAME 
INTENSITY AS S*. IF THERE WAS NO RESPONSE TO S-t IT 
TERMINATED IN 5 SECONDS. IN THE DELAY CONTINGENCY* 5 SECONDS 
HAD TO ELAPSE WITHOUT A RESPONSE BEFORE IT WOULD TERMINATE. 

IN THE NO-DELAY CONTINGENCY A RESPONSE IMMEDIATELY TERMINATED 
ITt AND IT WAS THEN REPROJECTED. THE SUBJECTS WERE TRAINED 
UNTIL THEY WENT THROUGH ONE SESSION OF 20 PRESENTATIONS OF 
EACH STIMULUS WITH NO S- RESPONSES. SUBJECTS TRAINED WITH THE 
FADING PROCEDURE MADE SIGNIFICANTLY FEWER RESPONSES IN THE 
COURSE OF TRAINING THAN SUBJECTS TRAINED WITH THE CONSTANT 
PROCEDURE. SUBJECTS TRAINED WITH THE CONSTANT-DELAY PROCEDURE 
HADE SIGNIFICANTLY FEWER RESPONSES THAN THE SUBJECTS TRAINED 
WITH THE CONSTANT-NO DELAY PROCEDURE. IN THE LATTER. IT WAS 
FELT THAT THE OFFSET OF THE S- STIMULUS WITH A RESPONSE WAS 
ACTING AS A REINFORCEMENT. (DR) 
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Extinction in Discrimination Learning: Presentation^and 

Contingency Variables and Associated Side Effects 
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Miriam Cohen , Robert Glaser, and James G. Holland 

Abstract 

The purpose of the study was to assess the effects of two methods 
of stimulus presentation (fading and constant) and two response contin- 
gencies (delay and no— delay) on the course of discrimination learnings 
and to ne the effects of response histories on various aspects of 
discrimination performance. In the fading procedure S- was gradually 
faded along the dimensions of brightness and time, and in the constant 
procedure S— maintained a constant value throughout training. Subjects 
trained with the constant procedure made significantly more S- responses 
in the course of acquisition than subjects trained with the fading pro- 
cedure. In the constant-delay procedure, S- responses were followed by 
a delay in the offset of S-. In the constant-no-delay procedure S- re- 
sponses were followed by an Intertrial Interval. Subjects trained with 
the constant-no-delay procedure made significantly more responses to S- 
than subjects trained with the constant-delay procedure. 

The amount of extinction which occurred during learning was highly 
correlated with Intertrlal responding and the stability of the learned 
discrimination. Discrimination reversal learning was also a function of 
original learning history. The results were discussed in terms of the 
differential processes which underlie discriminative performance when 
different training procedures are used. 

Introduction 

The classical view that extinction of S- responses is required for 
discrimination learning has been reexamined by Terrace (1963). His success 
in establishing discriminative behavior in pigeons with minimal or no S- 
respondlng raises questions about the variables which control the occurrence 
of S- responses. The emphasis in previous work on the necessity for error 
responses in discrimination learning has resulted in some neglect of the var- 
iables which control their occurrence. 
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Recent studies have shown that when a "fading" procedure Is employed, 
fewer S- responses occur In the course of acquisition of the discrimination 
than when a constant method Is used (Moore & Goldlamond, 196A; Ray, 1967; 
Sldman & Stoddard, 1967; and Terrace, 1963). In the fading procedure stim- 
ulus control Is transferred from one set of stimuli to another; stimulus 
support Is gradually withdrawn from a previously learned, or easily estab- 
lished discrimination, and control Is transferred over successive trials to 
a new stimulus dimension. An analysis of the events following an S- response 
further Indicates that when this response Is followed by a delay in the off- 
set of S-, there Is less possibility for S- responses to be adventitiously 
reinforced by the onset of a new event (Sldman, 1960, p.> 351). In uhe pre- 
sent study the effects of stimulus presentation methods (fading and constant) 
and response contingencies (delay and no-delay) on the course of S- respond- 
ing In discrimination learning were Investigated. The joint effect of manip- 
ulating these two variables was to provide subjects who had acquired the 
same discrimination performance but had different behavioral histories with 
regard to the amount of S- responding or extinction that had occurred in 
the course of acquiring the discrimination. 

With respect to the effect of this behavioral history on discrimina- 
tion performance there is evidence to suggest that the effects observed in 
extinction with a single stimulus are also found when extinction occurs In 
discrimination learning. Terrace's (1966b) results indicated that spontane- 
ous recovery of an extinguished response and concomitant side effects occur 
when the discrimination Is learned with extinction. The mechanism underlying 
these effects seems to be Inhibition. This was suggested by several kinds of 
evidence: the effect of the tranqulllzlng drugs which released this Inhibi- 

tion (Terrace, 1963), the demonstration of inhibitory gradients surrounding 
S- (Terrace, 1966a), and the peak shift In the postdls crimination generaliza- 
tion gradients around S+ (Terrace, 1964). Terrace (1963) also found that S- 
responses were accompanied by Inter trial responses. In contrast. In a dis- 
crimination acquired without extinction there was no evidence of the occur- 
rence of these phenomena. These differences indicate that extinction In dis- 
crimination learning can have auxiliary side effects which are not present 
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when a discrimination is acquired without extinction. The present study 
investigated the effects of different amounts of extinction in discrimin- 
ation learning on Irrelevant Intertrlal behavior accompanying learning, 
the stability of the acquired discrimination, and the amount of Inhibition 
associated with S-. 



Method 

Independent Variables 

Stimulus presentation . A fading procedure was employed in which S- 
was Introduced to the subject in two phases during the first experimental 
session. During Phase I, S- was of a constant duration, 0.5 seconds, and 
its brightness was gradually Increased from very dark to full brightness, 
l.e., a level equal to S+» At the beginning of Phase II, S- was again made 
very dark and both the brightness and duration of S- were Increased simul- 
taneously. S- was dark initially and on successive presentations it became 
brighter and its duration was Increased progressively from one to five sec- 
onds. The last four S- stimuli in the first session were of full bright- 
ness and full duration. There was no further fading of S- after the first 
session. The brightness and duration of S- on successive presentations dur- 
ing the fading sequence are presented in Table 1, in which Phase I and 
Phase II of the fading sequence are further subdivided into stages which 
will be referred to in the results section. This fading sequence is the 
sequence that was presented to those subjects who made no S- responses. If 
a subject made an S- response during the fading sequence, the same S- was 
presented again. 



Insert Table 1 about here 



A constant method of stimulus presentation was employed in which S- 
was as bright as S4> and of maximum duration (five seconds) on the first trial 
and on all subsequent presentations. 

Reinforcement contingencies . The events which followed a correct 
response were the same in all of the training procedures. A response to 
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S+ and no response to S- constituted correct responses. When a response was 
aade to S+, the subject received an Mfilf candy, the stimulus was terminated, 
and a new stimulus was presented after an Intertrlal Interval. When no re- 
sponse was made to S- for five seconds, the stimulus was terminated, and a 
new stimulus was presented after the Intertrlal Interval. 

The events which followed an Incorrect response differed In the dif- 
ferent training procedures. No response to S+, or a response to S- consti- 
tuted an Incorrect response. The event which followed an Incorrect response 
was either a delay or no delay In the offset of the stimulus. When the delay 
contingency was In effect, each response to S- reset a five-second timer. 

S- could not be terminated until five seconds without a response to S- had 
elapsed. S+ could not be terminated until a response to S*!* had occurred. 

Thus , every trial ended with a correct response and a new stimulus was pre- 
sented after the Intertrlal Interval. In the no-delay contingency each 
response to the S- terminated the S-. The same S- was presented again after 
the Intertrlal Interval. S+ was terminated after five seconds If the sub- 
ject had not responded, and the same S-l* was presented again after the Inter- 
trlal Interval. In this procedure the offset of S+ and S- was not contin- 
gent on the occurrence of a correct response. 

The two methods of stimulus presentation and the two reinforcement 
contingencies were manipulated In three different training procedures: (1) a 

fadlng-no-delay procedure, (2) a constant-no-delay procedure, and (3) a 
constant-delay procedure. 

Subjects 

TWenty-seven kindergarten children were assigned to one of the three 
discrimination procedures. Four other subjects, for particular reasons to 
be discussed later, were trained first with one procedure, and then changed 
to another procedure. 

Apparatus 

The subject was seated In front of an aluminum panel 17 x 21 Inches 
which had a circular aperture 5 1/2 Inches In diameter. The stimuli, black 
lines on a white or gray background, were projected with a Kodak Carousel 
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slide projector. Model #550, on a plexiglass screen mounted behind the 
aperture. ^ The black lines were 4 3/4 Inches long and 1/8 inch wide. In 
the discrimination task the S+ stimulus was a vertical line and the S- 
was a horizontal line. The subject responded by pressing on the plexi- 
glass screen and his response was recorded on an Esterline Angus event 
recorder. Model AW, which provided a trlal-by-trlal record of ongoing be- 
havior. The reinforcement for S+ responses consisted of M&M's dispensed 
into a cup by an HMD-1 candy dispenser made by Davis Scientific Instruments. 
Between trials a shutter closed and the screen was dark. Any responses 
which occurred during this period were recorded. The shutter, MfiM dis- 
penser, and the back of the stimulus display unit were covered by a sound- 
proofed masonite box. Relay equipment housed in an adjacent room was used 
to coordinate and control stimulus presentations and response contingencies. 

Training Procedures 

The following general procedures were employed for all three methods 
of discrimination training. When the subject first entered the room, the 
instructions to him were, "Press the window, and see if you can get an M&M." 
When the next stimulus appeared, he was told, "press the window," only if 
he had not already done so. No further instructions were given in any of 
the sessions after the first session. The first five stimuli presented to 
the subject in the first session were S+ stimuli and each response to the 
S*f* was followed immediately by an MAM. After the first five S+ stimuli had 
been presented, S+ and S- were presented in a random sequence on successive 
trials, and the three different training procedures were put into effect. 

On the first day of training all subjects had a total of 55 stimuli (32 S+ 
and 23 S- stimuli). In all subsequent sessions there were 40 stimuli (20 
S+ and 20 S- stimuli). All subjects were trained until they reached a cri- 
terion of one session (40 stimuli) with no responses to S-. After each sub- 
ject had reached this criterion, he was given two post-criterion sessions. 

Between each of the first five successive stimulus presentations 
there was an intertrlal interval which was gradually lengthened., Responses 
to the dark screen during this interval were considered a measure of irrele- 
vant responding in the course of discrimination learning. In order to elim- 
inate (or shape out) these responses, the interval was a variable interval 
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with a range of one to four seconds and a mean of about two seconds. If a 
response occurred at any time during the Interval it: 1) stopped the tape 

which was timing the variable interval, and 2) tripped off another timer 
idiich added ten seconds to the remaining time in the interval. 

Posttraining Procedures 

After all sid>jects, nine in each of the three training procedures, 
had acquired the discrimination and had two postcrlterlon sessions, three 
subjects from each of the groups were assigned to one of three posttrain- 
ing groups. One group was given a retention test after two weeks, consist- 
ing of a single session with 20 S+ and 20 S- stimuli, and procedures iden- 
tical to those used during original training were employed. The other two 
groups were given discrimination reversal training, either immediately fol- 
lowing or two weeks after the last postcrlterlon session. In reversal train- 
ing the new S+ was a horizontal line (the old S-) and the new S- was a 
vertical line (the old S+). For all subjects, during reversal training the 
delay contingency was in effect. In the first day of reversal training, 
the new S+ was presented first. Latency of response to the new S+ was used 
as the index of Inhibition. The S+ remained on the screen until the subject 
pressed the window. Since it had been observed that some subjects would begin 
to walk out of the room rather than press the window, a procedure was employed 
to try and prevent the subjects from leaving the room. If the subject began 
to leave the room a "free" M&M was dispensed. The free M&M fell into the cup 
in the presence of the new S+ without the subject having to press the window. 
The significant aspect of leaving the experimental situation is that it might 
be Inferred that it represents one extreme of the continuum of Inhibition; so 
much Inhibition was associated to the new S+ that the subject walked out of 
the room to avoid a stimulus which had been associated with extinction. The 
other extreme of this continuum would be no Inhibition to S+ in which case the 
subject would be expected to respond immediately to the new S+. The second 
stimulus presented in reversal training was another new S+. The stimulus re- 
mained on the screen until the subject pressed the window or started to leave 
the room, at which time he was given another free M&M. Five S+ stimuli were 
given in succession at the beginning of the first day of reversal training. 

The number of stimuli presented were the same as in original training. 
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Results 

S- Responding In Discrimination Learning 

A comparison of the groups trained with the three different proce- 
dures Indicates that there were significant differences In the total number 
of S- responses that each group made In the course of discrimination learn- 
ing. Table 2 shows the number of S- responses made by each subject* and the 
total number of S- responses for each of the three groups. An analysis of 
variance Indicates that the differences between the groups were significant 
beyond the .01 level. 



Insert Table 2 about here 



Fading 

Several different fading procedures had been tried In pilot work 
before a program was established In which a discrimination could be acquired 
with minimal S- responding. The development of the task and the fading pro- 
cedure Is described elsewhere (Cohen, 1967). Table 3 shows the data for the 
fadlng-no-delay group. The first column labeled Session 1 gives the number 
of S- responses that each subject made during each stage of the fading sequence. 
As shown In Table 1, the different stages refer to successive approximations 
to the terminal S-. The few S- responses that were made In stages 1, 2, 3, 
and 4 Indicate that fading on the dimensions of brightness and time was ef- 
fective In controlling the occurrence of S- responses. Since no S- responses 
occurred In Stage 5, It can be concluded that stimulus control had been trans- 
ferred from brightness and time to the dimension of line orientation. The 
remainder of the columns In Table 3 show the number of S— responses that each 
subject made In each quarter-session after the fading sequence prior to 
reaching criterion. A quarter-session consisted of five S- stimuli present- 
ed In random sequence with S+ stimuli. The total number of S- responses that 
each subject made before reaching criterion Is given In the right-hand column 
of Table 3. The data clearly Indicate that the fading procedure employed was 
effective In establishing and maintaining the vertical-horizontal line dis- 
crimination with little or no S- responding. 
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Insert Table 3 about here 



Cons tant-delav 

When the course of extinction for each subject trained with the 
constant-delay procedure was plotted according to the number of S- responses 
that occurred in successive quarter-sessions a pattern of S- responding 
emerged which was the same for all of the subjects trained with this pro- 
cedure. Figures 1, 2, and 3 show this typical pattern. The greatest num- 
ber of S- responses occurred in the first quarter of the first session. On 



Insert Figures 1, 2, and 3 about here 



a given day a greater number of S- responses usually occurred in the first 
and second quarters of the session; few, and often no S- responses occurred 
in the third and fourth quarters of that session. On the following day, S- 
responses would occur again in the first and/or second quarter of the session. 
The number of S- responses in the first or second quarter was usually less 
than the number of S- responses which had occurred on the previous day in the 
same quarter session. The general pattern of the extinction curves indicates 
that there was extinction of S- responses within sessions and spontaneous 
recovery between sessions. 

Cons tant-no-delav 

When the course of extinction for each subject trained with the con- 
st ant-no- delay procedure was plotted according to the number of S- responses 
that occurred in successive quarter-sessions, a pattern of S- responding 
emerged which was the same for all subjects. Figures 4, 5, 6, and 7 show 
this pattern. The maximum number of S- responses fell in some quarter after 
the first quarter session. For some subjects this peak quarter session fell 



Insert Figures 4, 5, 6, and 7 about here 
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on the first day end for other subjects the peak quarter fell in some session 
after the first day. In quarter-sessions before the peak quarter session, S- 
responses increased in successive quarter-sessions. After the peak quarter, 

S- responses decreased in successive quarter-sessions. For two subjects, Tim 
and Kirk (shown in Figure 7) , S- responses had not extinguished after 16 ses- 
sions, at which time training was terminated. The increase in S- responding 
before it began to decrease suggested that S- responses were being reinforced 
in some way. It seems likely that the stimulus change from a lighted screen 
to a dark screen immediately following an S- response was acting as a rein- 
forcer. The data also indicate that if stimulus change was acting as a rein- 
forcer, it lost its property as a reinforcer, since S- responding did decline. 

Cons tan t-no-de lav Changed to Constant-delay 

Four subjects who began training with the constant-no-delay procedure 
showed little evidence of learning after five days and on the sixth day they 
were switched to the constant— delay procedure. After the change in proce- 
dure, two of these subjects acquired the discrimination and the two others 
had not reached criterion after 16 sessions, at which time training was term- 
inated. The total niimber of S- responses that each subject made during train- 
ing is given in Table 2. 

The course of S- responding in each training session for each subject 
is shown in Figures 8 and 9. The solid line between Sessions 5 and 6 indi- 
cates the change in procedure. Figures 8 and 9 show that S- responding was 
being maintained at a relatively stable rate or it was increasing in the 
first five sessions of training when the constant-no-delay procedure was in 
effect. When the delay procedure was instituted in Session 6, there was a 



Insert Figures 8 and 9 about here 



marked drop in the number of S- responses that occurred. This immediate drop 
can be attributed to the change in procedure, since the rate of S- responding 
had been fairly stable before the delay procedure was Introduced. Although 
Figures 8 and 9 are not plotted by quarter sessions, the maximum number of S- 
responses occurred in the first quarter-session after the change to the delay 
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procedure. The data obtained with this change in procedure give further 
support to the explanation that S- responses were being reinforced by 
stimulus change In the constant-no-delay procedure. S- responses must 
have been reinforced during the first five sessions to have been maintained 
at such a high rate. As soon as the delay procedure was Instituted, and 
an S- response was no longer followed by a stimulus change, S- responses 
began to extinguish. 

Auxiliary Effects of Extinction 

In order to examine the byproducts of extinction, comparisons were 
made between two groups, the fading group which acquired the discrimination 
with few or no extinction trials and the constant-delay group which acquired 
the discrimination with extinction. The data of the subjects trained with 
the constant-no-delay procedure were treated separately, since "S-" responses 
In this procedure did not seem to constitute extinction trials. 

Effect of S- Responding 

Intertrlal responding . No Intertrlal responses occurred for any sub- 
ject until after S- responses had occurred. When all 18 subjects In the two 
groups were rank ordered In terms of the number of S— and Intertrlal re- 
sponses that they made during acquisition, there was a correlation of .90 
(p <.001). Subjects trained with the fading procedures made few Intertrlal 
responses and subjects trained with the constant— delay procedure made many 
Intertrlal responses. When the data were analyzed In terms of the number of 
Intertrlal responses that followed correct and Incorrect responses It was 
found that 84Z of the total number of Intertrlal responses for all subjects 
followed S- responses. The other 162 followed a correct response, which was 
either a response to S’!* or no response to S-. These results Indicate that 
when little or no extinction occurred during discrimination learning, the 
reinforced response was under precise stimulus control. It can also be con- 
cluded from this evidence that one of the byproducts of extinction was the 
occurrence of Irrelevant Intertrlal behavior. 

Postcrlterlon performance . For the 18 subjects a rank difference cor- 
relation of .79 (p <.01) was obtained between the total number of S- responses 
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that each subject made during acquisition and the two immediate postcrl- 
terlon sessions. Retention measures were obtained for six subjects » three 
from each of the two groups. The rank difference correlation for these 
subjects between S-> responses during acquisition and retention testing two 
weeks after the final postcriterion session was .82 (p <.05). It can be 
concluded from this evidence that when S- responses have been extinguished 
in discrimination learning, it is likely that there will be spontaneous re- 
covery of the extinguished response. When S- responses have not occurred 
in original learning, and hence, have not been extinguished they are not 
likely to occur in postcrlterlon performance. 

Discrimination reversal learning . The purpose of the reversal task 
was to assess the inhibitory properties of a stimulus that had been associ- 
ated with extinction. It had been hypothesized that those subjects who had 
acquired the original discrimination with extinction would have Inhibition 
associated to the old S- (new S+), and those subjects who had not undergone 
extinction would have less Inhibition to the same stimulus. Response latency 
proved to be too variable a measure and no quantitative evaluation of the 
amount of inhibition associated to the new S+ could be made. However, two 
qualitative aspects of the subject's behavior in reversal learning were 
noteworthy. Table 4 shows those subjects who responded to the first S+ in 
reversal learning and those subjects who had to be given a free M&M before 
they responded to the new S+. In the immediate reversal group, subjects 
originally trained with fading procedures responded to the new S+. This 
suggested that these subjects had little Inhibition associated to the stimu- 
lus. Those subjects originally trained with the constant-delay procedure 



Insert Table 4 about here 



did not respond to the first new Si* before being given a free M&M, suggesting 
that Inhibition was present. In the delayed reversal group, two of the three 
sid>jects originally trained with the constant-delay procedure responded to 
the first new S+. This might be Interpreted to mean that inhibition had dis- 
sipated over the two weeks and that there was spontaneous recovery of the 
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extinguished response. However, there is no apparent explanation for the 
fact that the subjects originally trained with the fading procedures did 
not respond to the new S4* in the delayed reversal task. 

The right hand column of Table 4 Indicates those subjects who com- 
pleted reversal training and those who stopped coming to the laboratory 
before they reached criterion. The five subjects who would not continue 
training had all originally been trained with the constant-delay procedure. 
All subjects originally trained with the fading procedures completed re- 
versal training. Subjects in the fading group were undergoing extinction 
for the first time and subjects in the constant-delay procedure were under- 
going extinction for the second time. The data suggest that continued ex- 
perience with extinction procedures may cause the experimental situation 
to become aversive. 

Effects of Responding; A Reinterpretation 

Since responses to the horizontal line, originally designated as S-, 
were apparently being reinforced in the constant-no-delay procedure, this 
stimulus can no longer be properly referred to as S-. S- is used to indi- 
cate a stimulus, in the presence of which there is no reinforcement (Keller 
& Schoenfeld, 1950, p. 118). Therefore the horizontal line used in the con- 
st ant-no-delay procedure will be referred to as "S-." 

Intertrial responding . A rank difference correlation of .72 (p <.05) 
was obtained between "S-" and intertrial responding for the nine subjects 
trained with the constant-no-delay procedure. Of the Intertrlal responses, 
56Z followed "S-" responses and 44% followed either an S+ response or no 
response to "S-." Since the same response was reinforced in the presence of 
two different stimuli, both S+ and "S-," and Intertrlal responses occurred 
with approximately equal frequency following S+ and "S-" responses, it is 
possible that intertrlal responses were the result of stimulus generaliza- 
tion in the constant-no-delay procedure. 

Postcriterion performance . For the seven subjects given the two post- 
criterion sessions (two subjects did not reach criterion), there was a rank 
difference correlation of .74 (p <.05) between the number of "S-" responses 
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during acquisition and the postcrlterlon sessions* This recurrence of 
"S-" responses following the criterion session may indicate that stimulus 
change had regained its power as a relnforcer after a period of time* 

Discrimination reversal learning * Since "S-" responses in original 
learning had been reinforced, there was no basis for postulating that 
there was any Inhibition associated to the new S*f* Table 5 indicates that 
all subjects in the constant-no-delay group responded to the first new 
apparently because it had been previously associated with reinforcement* 
Table 5 also indicates that all subjects continued training until they 
reached criterion* It is possible that subjects in this group were willing 
to complete the task because they had no prior history of extinction* 



Insert Table 5 about here 



Discussion 

Stimulus Factors Influencing S- Responding 

In this study it was demonstrated that techniques similar to those 
used by Terrace (1963) also controlled the occurrence of S- responses when 
children learned a discrimination (by the successive method) bet«'een a 
vertical and a horizontal line* A question raised by these findings is: 
why does the fading procedure control the occurrence of S- responses? 

Terrace (1963) postulated that the reason pigeons did not respond to the 
S- when fading procedures were employed was that they had been adventitiously 
reinforced for not responding to S- by the onset of S+* In the present 
study the Intertrlal interval was gradually lengthened during the early 
trials and the response of not responding to the dark screen was reinforced 
by the onset of SH** When the first S- in the fading sequence came on the 
screen it was dark and 0*5 seconds in duration; it looked similar to the 
dark screen during the Intertrlal Interval* The subjects generalized to 
not responding to the initial S-, since they had already learned not to re- 
spond to the dark screen* In the first phase of fading, S- became progressively 
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brighter, and the stimulus remained constant at 0.5 seconds in duration. 

The behavior of not responding in Phase I, as S- became brighter, was 
probably maintained because of the brevity of S-, which made a response 
to it almost impossible. At the beginning of Phase II, S— was 0.5 sec- 
onds in duration and dark; as brightness and time Increased, S- respond- 
ing still did not occur. From the data it was not possible to determine 
whether brightness or time or both were controlling the occurrence of S— 
responses or whether different dimensions were controlling the occurrence 
of S- responses in different subjects. Brightness has been an effective 
fading dimension in several discrimination training procedures (Moore & 
Goldlamond, 1964; Sldman & Stoddard, 1967; Terrace, 1963, 1964). How- 
cvBTf the critical properties of a stimulus dimension which makes it ef- 
fective for controlling responding to a negative stimulus have not been 

determined. 

Response Contingencies Influencing S- Res ponding 

The results of this study suggest that an event following an S- 
response, with the exception of the continued presence of S-, may adven- 
titiously reinforce the response. When the delay contingency was in ef- 
fect, S- responses decreased progressively in successive sessions indicat- 
ing that they were not being reinforced when they were followed by a delay 
in the offset of S-. It is probable that variations in the delay of the 
offset of S- (following a response to it) would influence the course of 
extinction. Long periods of delay might, as Blough (1966) has suggested, 
prolong a period of nonreinforcement to the extent that it may be punishing. 

When a response to "S-” was not followed by a delay, "S-” responding 
Increased in frequency. The Increase in frequency of "S-" responses in the 
beginning of training can be interpreted to mean that the response-contingCint 
light termination (the onset of the inter trial Interval following an S- re- 
sponse) acted as a relnforcer for all subjects. Antonltls and Barnes (1961) 
found that lever pressing Increased in a group operant procedure with kin- 
dergarten children when it was followed by light termination. Similar re- 
sults have been obtained with rats (Leaton, Symmes and Barry, 1963; and 
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Roberts, Marx, and Collier, 1958). These results support the sensory rein- 
forcement hypothesis which postulates that response-contingent stimulation 
in many modalities is reinforcing (Kish, 1966). 

The wide range in the total number of "S-" responses that occurred 
for subjects trained with the constant-no-delay procedure (see Table 2) 
suggests that the power of light termination as a relnforcer varied con- 
siderably from subject to subject. The fact that two subjects were still 
responding to "S-" after 16 sessions indicates that in these two cases 
stimulus change was an effective relnforcer. However, with continued train- 
ing, "S-" responses decreased for all subjects in the constant -no-delay 
group indicating that light termination was no longer acting as a reln- 
forcer. Studies which have used light onset (Kish and Baron, 1962; and 
Roberts, Marx and Collier, 1958) and other sensory relnforcers have also 
found that prolonged exposure to a sensory relnforcer leads to a weakening 
of its reinforcing effects; with lack of exposure, the stimulus change 
recovers its reinforcing property (Kish, 1966). 

S- Responding and "S-'* Responding 

Similar terminal behavior — differential responding to vertical and 
horizontal llnes-^as observed for the subjects trained with the constant- 
delay and constant-no-delay procedures. A similar degree of stimulus con- 
trol, as indicated by Intertrlal responses, the absence of S- or "S-" re- 
sponses in criterion performance, and the recurrence of S- or "S-" responses 
in postcrlterlon performance, was observed for the subjects trained with 
both procedures. However, on the basis of the results of this study, it 
can be postulated that this similar behavior was the result of different 
underlying processes: extinction in the constant-delay situation, and sati- 

ation in the constant-no-delay situation. In the constant-delay procedure, 
the absence of S- responses may have been due to Inhibition which was built 
up to a stimulus that was associated with non-reinforcement as the result 
of extinction. The recurrence of S- responses (spontaneous recovery) was 
due to the dissipation of Inhibition associated with that stimulus. For 
those subjects trained with the constant-no-delay procedure, the absence of 
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"S-" responses was due to satiation on the reinforcing stimulus* The re- 
currence of "S-" responses was due to a recovery from the satiating effects 
of a sensory reinforcer* One method of testing this conjecture is to ex- 
amine the generalization gradients surrounding S-* If the S- and "S-" 
stimuli have different properties in these two cases, the shape of the 
gradients should differ* The generalization gradients for those subjects 
who are reinforced for "S-" responses should be a typical generalization 
gradient, in which the peak number of responses occur at "S-" and respond- 
ing decreases as the stimulus varies along the relevant dimension* The 
generalization gradient for those subjects who are not reinforced for S- 
responses should be an Inverted U* That is, few responses should occur at 
S-, and as the stimulus departs from S-, responses should Increase, since 
S- is controlling the tendency not to respond* Carrying out this experi- 
ment would require that S+ and S- ("S-") be on different contlnua so that 
responding to S- ("S-") is not confounded with the excitatory tendency 
surrounding S+ (Jenkins, 1965; Terrace, 1966a)* Such a study would facil- 
itate an analysis of the processes underlying discrimination learning when 
different training procedures are used to establish the discrimination* 
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TABLE 1 



Brightness and Duration Values of S- 


during the Fading Sequence 


Phase 1 


Duration 


Brightness 




(seconds) 


(foot candlei 


Stage 1 


S- 


0.5 


8.33** 


S- 


0.5 


9.00 


S- 


0.5 


10.33 


S- 


0.5 


11.00 


S- 


0.5 


13.00 


Stage 2 


S- 


0.5 


13.66 


S- 


0.5 


14.66 


S- 


0.5 


15.33 


S- 


0.5 


16.50 


S- 


0.5 


17.25 


Phase 11 
Stage 3 


S- 


1.0 


8.33 


S- 


1.5 


9.00 


S- 


2.0 


10.33 


S- 


2.5 


11.00 


S- 


3.0 


13.00 


Stage 4 


S- 


3.5 


13.66 


S- 


4.0 


14.66 


S- 


4.5 


15.33 


S- 


5.0 


16.50 


Stage 5 


S- 


5.0 


17.25*^ 


S- 


5.0 


17.25 


S- 


5.0 


17.25 


S- 


5.0 


17.25 



*The values given are an approxinate measure of incident light. 
^Very dark. 

^Full brightness. 



TABLE 2 



Nunber of S> Responses during the Course of Acquisition 


for the Different 


Training Procedures 


Training Procedure 


Number of S- Responses 


Fading 


Alexandra 


0 


Mary Beth 


0 


Artemlse 


0 


Michael D. 


2 


Greg 


2 


Connie 


2 


Frank 


3 


Jackie 


8 


Mark 


JB 

25 ~ Total 


Cons tant-delay 


Philip 


22 


Jessica 


22 


Laurie 


24 


Helen 


25 


Angela de P. 


27 


Tony 


37 


Angela G«. 


60 


Eric 


65 


Ernie 


79 

361 - Total 


Cons tant>no>delay 


Mary Jane 


51 


Michael S. 


53 


Kim 


58 


Susan 


64 


Charles 


89 


Ann 


104 


Jonathan 


274 


Kirk® 


512a 


Tima 


1147 a 

2352 - Total 



Cons tant-no-delay > 
Cons tan t-de lav 



Janet 


427 


Paul 


514 


Michael 0.® 


547® 


David® 


642® 



The Occurrence of S- Responses for Subjects Trained with the Fading Procedure 
Session 1 Session 2 Session 3 Session 4 Session 5 Session 
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TABLE 4 



Responses during Original and Reversal Learning 
for Subjects in the Fading and Constant*-*delay Groups 



Original 

Training 



Subject S- Responses Trial 1 Response Completion of 
in Original to S*(* in Reversal Training in 
Learning Learning Reversal 

Learning 



F Alexandra 0 
F Michael D. 2 
F Frank 3 
CD Philip 22 
CD Angela de P. 27 
CD Ernie 79 



R® 

R 

R 

f" 

F 

F 




F Mary Beth 
F Greg 

F Mark 

CD Jessica 

CD Tony 

CD Eric 



0 

2 

8 

22 

37 

65 



F 

F 

F 

R 

F 

R 



C 

C 

C 

C 

C 

C 



response to the first new S*f. 
free M&M dispensed, 
completed reversal training, 
did not complete reversal training. 



msrnswismmm 



iPPPi 
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TABLE 5 



Responses during Original and Reversal Learning 





Original 

Training 


Subject 


S- Responses 
in Original 
Learning 


Trial 1 Response 
to S*t* in Reversal 
Learning 


Completion of 
Training in 
Reversal ■ 
Learning 


Immediate 


CND 


Kim 


58 


R^ 




Reversal 


CND 


Charles 


89 


R 


c 




CND 


Ann 


104 


R 


c 


Delayed 


CND 


Mary Jane 


51 


R 


c 


Reversal 


CND 


Susan 


64 


R 


c 




CND 


Jonathan 


274 


R 


c 



^R ■ response to the first new S*f« 
■ completed reversal training. 



S>RESPONSES 




Figure 1 

Constant-delay: S- responses in acquisition. 
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Figure 2 

Constant-delay: S- responses in acquisition. 
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Figure 3 

Constant-delay: S- responses in acquisition 



o 

ERIC 






1 



26 



KIM 




Figure 4 

Constant-no-delay: S- responses in acquisition. 




SESSIONS 



Figure 5 

Constant-no-delay: S- responses in acquisition. 




Figure 6 

Constant-no-delay: S- responses in acquisition. 
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SESSIONS 



Figure 7 

Constant-no-delay: S- responses in acquisition. 




Figure 8 

Constant-no-delay changed to constant-delay: 
S- responses in acquisition. 




SESSIONS 

Figure 9 

Constant-no-delay changed to constant-delay: 
S- responses in acquisition 
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